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(54) Title: GENE ENCODING LABYRINTHIN, A MARKER FOR CANCER 



(57) Abstract 

A cDNA molecule that encodes a protein designated 
Labyrinthin (Lab) is isolated and its nucleotide sequence 
is determined. The protein, or peptides derived from 
the protein, are markers useful to define novel classes of 
cancers. Diagnostic assays for these cancers use antibodies 
to Lab or nucleotide probes that hybridize with the lab 
gene or a fragment therefrom. Vaccines useful either to 
prevent recurrence of cancers in subjects who test positive 
for Lab (or lab), or to prevent initial occurrence of cancer, 
use proteins or peptides derived from Lab. Expression of 
Lab via immunogenic assays is used to monitor effects 
of cancer treatments. Antisense molecules against lab 
are used in treatments. Sense molecules of lab are used 
to restore lost lab function in diseased normal cells, for 
example, gland cells. 
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GENE ENCODING LABYRINTHIN, A MARKER FOR CANCER 
5 Inventor: James A. Radosevich, Ph-D. 

The invention relates to a gene encoding a protein and peptides therefrom that 
includes an epitope, a cancer associated antigen, useful as a marker that is not 
restricted to previously defined histological classes of cancer. Antigenic peptides are 
1 0 useful as a vaccine for treatment and prevention of cancer, and for the preparation of 
new, specific, monoclonal antibodies. Antisense molecules are useful in 
pharmaceutical compositions and are useful for diagnosis and treatment. 

BACKGROUND OF THE INVENTION 

15 

Cancer 1 is a leading cause of death in men and women throughout the world. 
In the United States alone, over I million new cases are diagnosed each year, and over 
0.5 million deaths are reported annually (Landis, et al, 1998). Historically, tumors 
are grouped and treated, based in part by the tissues in which they arise, e.g.- breast 

20 cancer, colon cancer, and lung cancer, and the like. Yet, within lung cancer, for 
example, it is well recognized that these tumors are a very heterogeneous group of 
neoplasms. This is also true for tumors arising in other tissues. In part, because of 
this heterogeneity, there are complex and inconsistent classification schemes which 
are used for human tumors. Previous attempts to treat cancer have been hampered by 

25 1 ) the arbitrary classification of tumors arising within given tissues, and 2) by using 
microscopic methods based on how these tumors look (histological classification). 
Although existing classifications for 



i 



Terminology used herein is as follows, "cancer is a malignant tumor, wherein a 
"tumor" is an abnormal mass of tissue, that need not be malignant. "Neoplasm" 
is a form of new growth. 
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various tumor types have some prognostic value, almost all of the 
classifications fall to predict responsiveness to treatments and likelihood of 
cure or disease course. Improved classification schemes based on the 
biological constitution of these neoplasms is required to significantly alter the 
5 survival statistics of humans who have cancer. One approach to solving 

these problems is to locate molecules specific to tumors, preferably antigens 
in molecules that are markers for cancer cells. (A "marker" is defined herein 
as any property which can be used to distinguish cancer from normal tissues 
and from other disease states.) The markers' presence is then a basis for 

10 classification. 

Monoclonal antibodies (MCAs) prepared by somatic cell hybridization 
techniques, usually in mice, are useful molecular probes for the detection and 
discrimination of cellular antigens, and therefore have great potential for 
detecting cancer associated antigens. These antibodies bind to specific 

1 5 antigens and the binding is detectable by well known methods. When 

binding occurs, the inference is made that a specific antigen is present. 
Those cancer associated antigens which are exposed to the cell surface or 
found in the cancer mass, are molecular targets for the immune systems 
(including host antibodies) of the host. Recent findings suggest that cancer 

20 patients who have antibodies against their tumors, do better than those who 

do not mount this type of immune response (Livingston, et a/., 1994). 
Therefore, natural, induced, or administered antibodies are a promising 
therapeutic approach. 

The humanization of non-human MCAs (the process by which 

25 non-human MCA reactive sites are shuttled into cloned human antibodies 

and expressed) results in reduced immunogenicity of the foreign antibodies 
without the loss of their specific binding in in vivo and in ex vivo applications. 
MCAs can be used as in vivo imaginfl agantu, diegnostle testi, and fer 
mtrapy (Ruflaitviah, eitil, 1988, 1990| Resen, etal. 1986). 

30 Vaccine therapy is a well established approach directed at inducing an 

immune response without exposure to the causative agent of a disease or 
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condition. Many vaccines are available, for example, to stimulate a response 
in a host to bacterial and viral agents. The use of tumor associated antigens 
(markers) in a vaccine could prevent primary cancer occurrence, and could 
also provide a means to prevent recurrence of the disease. 

5 Gene therapy is a means by which the genetic make-up of cells is 

modified to express the gene of interest. There are many forms of gene 
therapy including: gene replacement, antisense suppression therapy, and 
surrogate gene expression. Discovering genes encoding cancer-associated, 
preferably cancer-specific antigens (markers) opens the door to genetic 

10 intervention against cancer cell proliferation. The accurate and consistent 

use of a cancer marker to differentiate cancerous from normal tissue, not only 
has diagnostic potential, but is also desirable for treatment and prognosis. 
Therefore, such markers have been sought. 

Recent studies have shown that the enzyme encoding human aspartyl 

15 beta-hydroxylase (HAAH) is overexpressed in some human adenocarcinoma 

cell lines, and in primary hepatocellular cancers, therefore could be a marker. 
The gene said to encode HAAH has been cloned and sequenced (Gronke, 
era/., 1989, 1990; Wang, ef a/., 1991; Jia, era/., 1992, 1994; Korioth, era/., 
1994; Lavaissiere, era/., 1996). However, little is known about HAAH 

20 expression in human tumors in general (Lavaissiere, ef a/., 1996). 

The study of the HAAH enzyme grew out of the study of its bovine 
counterpart (Gronke, ef a/., 1989, 1990; Wang, ef a/., 1991; Jia, ef a/., 1992). 
Bovine aspartyl beta-hydrqxylase is an intracellular, glycosylated protein, 
localized in the rough endoplasmic reticulum. The protein has been reported 

25 to have three major species of molecules; a 85 kilodalton form, and two active 

forms with molecular weights of 56 and 52 kilodaltons respectively 
(Lavaissiere, ef a/., 1996). 

Using standard biochemical methods, bovine aspartyl 
beta-hydroxylase (bAAH) has been purified and characterized ( Gronke, et al. 

30 1990; Wang, ef a/., 1991 ). The activity of the enzyme has been shown to be 

correlated with the 52 and 56 kilodalton species which were purified. 
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Immunologically, a related higher molecular weight form (85-90 kilodalton) was also 
observed. As part of the purification, bAAH is bound to Con A sepharose, which is 
consistent with the conclusion that the enzyme is glycosylated. (Subsequent reports 
on the DNA sequence show three possible glycosylation sites, with one site being 
5 very close to the known active enzyme domain.) The protein is very acidic in nature, 
and a detergent is not required to solubilize the active fraction. The active enzyme 
site is dependent from the biochemically isolated bovine protein (bAAH) on the 
presence of histidine at position 675 (Jia, et al, 1994). 

A partial amino acid sequence was obtained for HAAH. DNA probes (a DNA 
10 probe is a molecule having a nucleotide sequence that is capable of binding to a 

specified nucleotide sequence under certain conditions) deduced from this amino acid 
sequence was used to screen a bovine cDNA library (Jia, et al, 1992). (A cDNA 
library contains the sections of DNA that encode for gene products, e.g. peptides as 
opposed to genomic DNA). Several overlapping cDNA sequences in the library 
1 5 contained a 764 amino acid open reading frame (ORF) sequence which will be 

expected to encode an 85 kilodalton protein. Also present in this ORF sequence were 
two other possible start codons, that is, locations at which encoding begins. The most 
3' start codon was preceded by a ribosome binding site. Translation of the clone 
having this sequence resulted in a protein that was about 85 kilodaltons. Antiserum 
20 was raised to the membrane fraction of human MG-63 calls and was used to 
immunoscreen a cDNA library made from MG-63 cells. Data on one clone was 
reported which could encode a 757 amino acid protein, and, by sequence analysis, was 
found to have strong N-terrninal homology with bAAH (Korioth, et al, 1994). When 
this clone was used in an in vitro translation system (an artificial cocktail of normal 
25 cell cytoplasm used to convert mRNA into protein), a 56 kilodalton protein was 
produced. It was suggested that this was due to posttranslational cleavage. 

The HAAH enzyme is responsible for the modification of specific aspartic 
acid residues within the epidermal growth factor-like domains of 
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proteins. It has been hypothesized that these modified aspartic acid residues 
allow the epidermal growth factor-like domains to become calcium binding 
domains. (Gronke, etal., 1989, 1990; Wang, ef a/., 1991; Jia, ef a/., 1992, 
1994; Korioth, etal., 1994; Lavaissiere, etal., 1996). 

An enzyme related to HAAH, aspartyl beta-hydroxylase (MH), was 
first studied because it specifically modified select aspartic acid or 
asparagine residues in a group of biologically important proteins including the 
vitamin K-dependent coagulation factors VII, IX, and X. Other proteins like C, 
S, and Z also have this modification (Gronke, ef a/., 1989, 1990; Wang, etal., 
1991; Jia, etal., 1992, 1994; Korioth, etal., 1994; Lavaissiere, era/., 1996). 
Aspartic acid and asparagine residues have been shown to be modified by 
HAAH in proteins containing epidermal growth factor-like domains. The 
function of the beta-hydroxyaspartic and beta-hydroxyasparagine residues is 
unknown, however, it has been speculated that this modification is required 
for calcium binding in the epidermal growth factor EGF-like domains of 
selected proteins. 

Antibodies were raised to human hepatocellular carcinoma FOCUS 
cells (Lavaissiere, ef a/., 1 990). One MCA reacted with an antigen that was 
highly expressed in hepatocellular carcinomas (Lavaissiere, ef a/., 1996). 
Immunoscreening using this antibody and a lambda gt1 1 HepG2 library 
resulted in the isolation of a partial cDNA, which was subsequently used to 
isolate a larger clone. 

A human adenocarcinoma cell line designated A549 was reported as 
having very high levels of HAAH activity (Lavaissiere, etal., 1996). A mouse 
monoclonal antibody designated MCA44-3A6 (U.S. Patent No. 4,816,402) 
was produced against the human adenocarcinoma cell line A549 (ATCC 
accession number CCL 185) (Radosevich, ef a/., 1985). The antibody 
recognized a cell surface, non-glycosylated antigenic protein having an 
estimated apparent molecular weight of 40 kDa). 

The antigen was expressed by A549 cells, and was found to be a good 
adenocarcinoma marker; that is, it was frequently expressed by cancers 
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which looked like adenocarcinomas when examined histologically 
(Radosevich, eta!., 1990a; Lee, et a/., 1985). MCA44-3A6 is unique in that it 
is the first monoclonal antibody which has this binding specificity. The results 
from an International Workshop for Lung cancer confirmed other related 
published findings on MCA44-3A6 (Stahel, 1994). 

The antibody designated MCA 44-3A6 has clinical utility because it 
differentiates antigens associated with adenocarcinomas. The normal and 
fetal tissue distribution of the antigen is restricted to some glandular tissues 
(Radosevich, et a/., 1991 ). Detection can occur on formalin fixed-paraffin 
embedded tissue (Radosevich, eta!., 1985, 1988, 1990a, 1990b; Lee, era/., 
1985, 1986; Piehl, era/. 1988; Combs, etal., 1988b, 1988c; Banner, era/., 
1985). The antibody has a restricted binding pattern within human pulmonary 
tumors (Lee, era/., 1985; Banner, etal., 1985; Radosevich, etal., 1990a, 
1990b). 

In a study of over two hundred pulmonary cancers, MCA 44-3A6 was 
found to react with all of the adenocarcinomas tested, many of the large cell 
carcinomas, as well as with subsets of intermediate neuroendocrine small cell 
lung cancers, well-differentiated neuroendocrine small cell carcinomas, 
carcinoids, but not mesotheliomas. MCA 44-3A6 does not react with 
squamous cell carcinoma, bronchioloalveolar carcinoma, or small cell 
carcinoma (Lee, et a/., 1985). MCA 44-3A6 is useful in distinguishing 
adenocarcinomas that are metastatic to the pleura from mesothelioma (Lee, 
et at., 1986). The antibody has selected reactivity among adenocarcinomas 
and in large cell carcinomas (Piehl, etal., 1988; Radosevich, era/., 1990b). 

In a study of over 40 cases of lung cancer comparing cytological and 
histological findings, MCA 44-3A6 was found to be useful in cytological 
diagnosis and was consistent with the histological finding (Banner, et a/., 
1985). Histology is the study of tissues (which are made of cells). Cytology 
II mo itufiy of eolli whleh have been removed frem the organizational context 
which is commonly referred to as tissue. Cells removed from tissues do not 
always behave the same as if they were in the tissue from which they were 
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derived. Fortunately, the antigen detected by MCA 44-3A6 expressed in 
adenocarcinoma cells in tissue behaves in the same ways as 
adenocarcinoma cells removed from tissues. This is a very diagnostically 
important characteristic. Similar correlations using cytoiogically prepared cell 

5 blocks of pulmonary carcinomas, as well as ACs presenting in body fluids 

from other sites throughout the body were demonstrated (Lee, et a/., 1985; 
Spagnolo, etal., 1991; Combs, era/., 1988c). Also, MCA44-3A6 binds to 
adenocarcinomas from sites other than lung cancer. The expression of the 
antigen in primary and metastatic lesions was also reported ( Combs, et al., 

1 0 1 988a). The utility of the MCA antibody in differentiating cancer from benign 

lesions in human breast tissue was also noted (Duda, etal., 1991). 

The cellular localization of the antigen detected by MCA 44-3A6 was 
determined. By using live cell radioimmunoassays (a radioactive antibody 
test directed at determining binding of the antibody to live cells), 

15 immunofluorescence, and live cell fluorescence activated cell sorter (FACS) 

analysis, the antigen detected by MCA 44-3A6, was shown to be on the 
outside surface of the cell (Radosevich, etal., 1985). Additional studies 
using immunogold-electron microscopy and FACS analysis have 
demonstrated that this antigen is non-modulated (that is not internalized by 

20 the cancer cell when bound by an antibody), is expressed on the extracellular 

surface of the plasma membrane, and is not cell cycle specific that is, the cell 
makes protein all the time it is going through the process of cell replication, 
and also when it is not dividing (Radosevich, etal., 1991). The antigen is not 
found in the serum of normal or tumor bearing patients, and is not shed into 

25 the culture media by positive cell lines (that is, cancer cells are known to bleb 

off portions of their cell membranes and release them into the surrounding 
fluid.) (Radosevich, etal., 1985). Recently 3 of 27 randomly tested 
adenocarcinoma patients were found to have naturally occurring antibodies 
to the antigen. In addition, radiolabeled MCA 44-3A6 was used to localize 

30 A549 tumors growing in nude mice. A douxorubicin immunoconjugate 

MCA 44-3A6 is selectively toxic in vitro (Sinkule, et al., 1 991 ). 
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Determination of the nucleotide and amino acid sequences of the antigen 
detected by MCA 44-3 A6 would enhance the usefulness of this antigen in cancer 
diagnosis, treatment and prevention. 

5 BRIEF SUMMARY OF THE INVENTION 

The antigen detected by the antibody MCA 44-3 A6 as described in the 
Background is now designated as 'Xabyrinthin." A gene (designated labyrinthin; 
abbreviated lab) characterized by a unique nucleotide sequence that encodes the 
antigen detected by MCA 44-3A6 was isolated and characterized, (lab notation 
10 signifies the nucleic DNA/RNA forms; "Lab" notation refers to the protein which is 
encoded by the lab DNA/RNA). 

The invention described herein used the antibody MCA 44-3 A6 as a tool to 
clone the gene encoding Lab. In addition, an epitope (the necessary binding site for 
an antibody found on the antigen) for MCA 44-3A6 was identified on the Lab protein 
1 5 expressed by the clone to be PTGEPQ. 2 The epitope represents an important 

irnmunodorninent sequence; that is, when injected into animals, the animals readily 
produce antibodies to this sequence. 

An aspect of the invention is the use of lab DNA in the sense 3 expression 
mode for: I) the marking of human tumors by nucleotide probes; 2) the detection of 
20 DNA and mRNA expression of lab in cells and tissues; 3) the transformation of cells 
into a glandular-like cell type; 4) the production of Lab antigen in vivo for 
immunization; 5) the ex vivo expression of Lab for immunization to produce 
antibodies; and 6) production of Lab in vitro. Use of an antisense molecule, e.g. by 
production of a mRNA or DNA strand in the reverse orientation to a sense molecule, 
25 to suppress the growth of /afryn/i^m-expressing (cancerous) cells is another aspect o 
the invention. 



2 Standard abbreviations for amino acids. 

3 The normal transcription of a DNA sequence which proceeds from the 3' to the 
5' end to produce a mRNA strand from the sense strand of DNA, the mRNA 
being complementary to the DNA. 
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An aspect of the invention is a vector comprising a DNA molecule with a 
nucleotide sequence encoding at least an epitope of the Lab antigen, and suitable 
regulatory sequences to allow expression in a host cell. 

Another aspect of the invention is an amino acid sequence deduced from the 
5 protein coding region of the lab gene. Selected regions of the sequence were found 
via immunological methods, to produce effects corresponding to effects from both 
naturally occurring (from cancer cells), chemically produced (synthetically produced 
peptides), and expression products of the cloned lab gene. 

Another aspect of the invention is the use of the entire deduced amino acid 
10 sequence of Lab, peptides derived from Lab, or chemically produced (synthetic) Lab 
peptides, or any combination of these molecules, for use in the preparation of vaccines 
to prevent human cancers and/or to treat humans with cancer. For purposes of the 
present invention, "humans with cancer" are those persons who have the Lab antigen 
detected in their cells. These preparations may also be used to prevent patients from 
1 5 ever having these tumors prior to their first occurrence. 

Monoclonal antibodies directed to the Lab protein, or antigenic components or 
derivatives of Lab proteins, are useful for detection of Lab and for other purposes. 
Monoclonal antibodies which are made in species other than those which react with 
the Lab antigen can be modified by a number of molecular cloning methods such that 
20 they retain their binding with the Labyrinthin peptides, yet are not immunogenic in 
humans (Sastry, et al, 1989; Sambrook, et al, 1990). In brief, this is done by 
replacing the binding site sequence of a cloned human antibody gene, with the 
binding site sequence of the non-human monoclonal antibody of interest. These 
"humanized" MCAs are used as therapeutic and diagnostic reagents, in vivo, ex vivo, 

25 and in vitro. 

The use of the Lab protein or antigenic peptides derived therefrom in 
diagnostic assays for cancer is a way to monitor patients for the presence and amount 
of antibody that they have in their blood or other body fluids or tissue. This detection 
is not limited to cancers of a class or classes previously defined, but is useful for 

3 0 cancer cells that have the Lab marker 
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antigen. The degree of seroconversion, as measured by techniques known to those of 
skill in the art [e.g., ELISA (Engrail and Perlmann, 1971)] may be used to monitor 

treatment effects. 

Treatment with antisense molecules to lab or antibodies to Lab in a 
5 pharmaceutical composition is an approach to treat patients who have Lab in, or on, 
their cancer cells. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is the nucleic acid sequence of the lab gene. 
10 FIG. 2 is the amino acid sequence for Lab, deduced from the lab gene. 

FIG. 3 is an illustration of the lab gene and how it is related to the HAAH 
enzyme. 

DETAILED DESCRIPTION OF THE INVENTION 
1 5 Molecular Biolnpv of Lahvrinthin : To demonstrate that the epitope MCA 

44-3 A6 is encoded by a protein sequence, high molecular weight DNA from the cell 
line A549 was isolated. This DNA was co-precipitated (via calcium) with a plasmid 
(pSVneo), and used to transfect a mouse cell line designated B78H1 cells (Albino, et 
al, 1985). This mouse cell line is negative for the expression of the epitope and was 
20 reported to have a high frequency of incorporation and expression of any human DNA 
sequences. If a given B78H1 cell was in a state to take up DNA, it would be expected 
to have taken up both human DNA and the plasmid DNA. The plasmid DNA makes 
the cell resistant to G418 (a normally toxic drug). Therefore, if a cell normally 
sensitive to G418 growth inhibitor grows in G418, it had to have taken up the 
25 plasmid, and may also have taken up one or more A549 DNA sequences. After G41 8 
selection (a way of choosing only cells which have resistance to growth in G41 8 by 
the uptake/expression of the Neo gene on pSVneo plasmid, and therefore representing 
cells that were in a state to uptake other DNA at the same time), approximately 15 of 
1x10 s clones were detected using immunoselection with MCA 44-3A6. This finding 
30 is consistent 
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with a conclusion that human A549 cells have DNA that encodes Lab and 
possesses the regulatory sequences necessary for the expression of Lab. 

Comparison of HAAH and Labvrinthin: Because the DNA sequence of 
lab was determined as an aspect of the present invention, HAAH and lab 

5 coutd be compared. HAAH and the lab nucleotide sequences have some 

internal fragment similarities, but are different on either side of the fragment, 
and are related to different products. This conclusion is based in part by the 
analysis and homology of the DNA sequences reported for these two genes. 
Specifically, the lab 5' region has no homology with HAAH. The protein 

1 0 coding region of lab has about a 99.6% homology with an internal segment of 

the proposed protein coding region for HAAH. The 3' region has no 
homology with the HAAH reported sequence. Virtually all of the other data 
comparing HAAH and labyrinthin are different, for example: 1) molecular 
weights of the proteins, 2) cellular localization, 3) chromosome localization, 

15 4) histological presentation in normal tissues and tumors, 5) northern blot 

expression, 6) immunological findings. 

Although the protein coding region of lab is identical to an internal 
region of the sequence reported for HAAH, the 5' untranslated region of 
HAAH is different, and part of the 5' translated protein coding region of HAAH 

20 is missing from that found in the lab clone. From both HAAH and lab clones, 

the deduced protein would be expected to be very acidic in nature, and 
therefore would run anomoiously in SDS gels. As predicted, the Lab protein 
migrates anomoiously in SDS gels. What was cloned and disclosed in the 
present invention migrates identically to the native protein found in several 

25 cell lines. Convincing evidence that the correct gene fragment encoding the 

antigen detected by MCA 44-3A6 has been cloned (mRNA) is that when the 
recombinant protein is made, that recombinant protein should act (in this 
case - have an apparent molecular weight) the same as independent 
biologically derived source of that protein. Lab provided from clones has the 

30 characteristics of Lab from cells. 
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The deduced amino acid sequence encoded by HAAH requires the use of an 
open reading frame which would produce a protein that is 85-90 kilodaltons, and does 
not take into account that there are several start codons and other shorter open reading 
frames. The deduced HAAH protein (biochemically) is glycosylated and the reported 
5 sequence has glycosylation sites ( Korioth, et al., 1994; Lavaisslere, et al, 1996). To 
the contrary, Lab is not glycosylated, nor does it have predicted glycosylated sites. 

The deduced HAAH amino acid sequence contains a region shared by the Lab 
amino acid sequence which is predicted to be very hydrophobic. Lab requires strong 
detergents in order to be soluble; HAAH does not. The increased expression of 
10 HAAH (by enzyme activity measurements) in the same cell line (A549) which was 
used to clone and study lab extensively, suggests that both of these gene products may 
be important to the AC phenotype and that at least A549 cells make both functional 
HAAH and Lab. Successful transfections of the antisense to lab into A549 resulted in 
a marked decrease in expression of lab and in the growth rate of the cells. The 
15 expression of a sense lab construct in NIH-3T3 cells (normal mouse fibroblasts) 
resulted in a marked change in phenotype, a phenotype consistent with that of ACs. 
Therefore, lab expression is associated with conversion of normal cells to cancerous 
cells. Lab and HAAH have potential calcium binding domains in common. 

r.HNA Library ^nntniction and Cloning : A cDNA lambda gtll phage library 
20 was constructed using mRNA which was isolated from actively growing A549 cells 
(Sambrook, et al, 1990). This oligo(dT)-primed cDNA was cloned into the Eco RI 
site using Eco RI linkers. The library has about 83% clear (containing an insert) 
plaques with a titer of 1 .2 x 1 0 ,0 /ml representing a minimum of 1 .46 x 1 0 6 
independent plaques which, by Polymerase Chain Reaction, have insert sizes ranging 
25 from 0.6 to 5 kilobases. Since Lab is a 40 kilodalton integral protein, (a protein which 
is embedded in the plasma membrane) the theoretical full length mRNA encoding this 
protein, including a potential leader sequence is estimated to be about 1 .1 kilobases. 
This library was 
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immunoscreened using the antibody MCA 44-3A6. Eight independently derived 
phage stocks (identical phage which are from the same plaque) were isolated. These 
have all been plaque purified by repeated cycles of immunoscreening/isolation. Upon 
Eco RI digestion of these eight isolates, inserts of about 2kb were seen. The largest 
5 insert was isolated (2A1A1) and the Eco RI fragment was cloned into the pGEM-3Z 
plasmid. 

Sequencing and Sequence Analysis : The DNA fragment designated 2A1A1 
was found to have an insert of 2442 base pairs in length (FIG. 1), containing a 5' 
untranslated region, a ribosome binding site, and a start codon which would be 
10 expected to encode a 255 amino acid protein (FIG. 2). The 3' untranslated region is 
remarkable in that it contains only four instability sequences; ATTTA (Xu, et al, 
1997). In addition there are sequences found in the very 3' end of mRNA's which 
result in adenylation of the mRNA (Sambrook, et al, 1990). The lab sequence 
contains both a sub-optimal (ATT AAA) and optimal (AATAAA) poly-adenylation 
1 5 site. These are sequences found in the very 3' end of mRNA's which result in 

adenylation of the mRNA. This rinding provides molecular data which supports the 
cellular and biochemical data that has been outlined herein. (The HAAH clone has a 
poly A signal, but the whole 3' region has not been sequenced.) 

A calcium binding site motif is noted in the Lab amino and sequence (FIG. 2), 
20 however, it is out of the known required structural context to be a binding site. In this 
case, the calcium limiting sequence is there, but it is not in a protein sequence context 
that is known to make it work as a binding site. Homology was noted with lab and an 
EST clone (designated #05501) which represented only a portion of the 3' 
untranslated region and independently confirmed this portion of the sequence. Some 
25 internal fragment homology is also noted with HAAH, but the 5' untranslated and part 
of the 5" translated region is different (58 amino acids), as well as a major portion of 
the 3' coding region is missing in lab (FIG. 3). 

Genomic DNA Tinning and Analysis : Using a PCR fragment representing the 
protein coding region of lab as a probe, a genomic lambda 
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FIX II library made from the human pulmonary fibroblast cell line WI-38 was 
screened. Ten primary plaques were isolated out of approximately 1x10 6 
screened plaques. Using seven of these as target DNA, Polymerase Chain 
Reaction conditions were established with primers for the protein coding 

5 region, producing a 765 base pair fragment, the expected protein coding 

region for lab. On Northern blots (a method used to qualitatively assess 
mRNA) lab only detects one band noted at 2.7 kilobases. The recombinant 
protein made from the lab clone, when tested on Western blots (a method 
used to qualitatively define proteins) using MCA 44-3A6, has the same 

1 0 relative mobility as the Lab protein when made by A549 cells. 

Lab and HAAH genes give different results in the proteins they 
encode. HAAH consistently gives two bands on Northern blot analysis (2.6 
and 4.3 kilobases) suggesting that the 2.6 kilobase band is due to alternative 
splicing, i.e. the cell cuts and splices the mRNA. Also, if lab and HAAH are 

1 5 the same gene, HAAH should be detected in all tissues and cancer cell lines 

in which Lab is found. However, Lab is not seen on Northern blots of cell 
lines EMT6 or QU-DB, nor is there immunoreactivity in these cells; indicating 
that Lab mRNA is not made, and that Lab protein is not produced in these 
cells. Lab protein is rarely expressed in normal cells, where both the HAAH 

20 mRNA and HAAH protein have been reported to be expressed in almost 

every tissue studied. 

mRNA Analysis: Northern blot analysis of the DNA fragment from the 
A549 cell line using lab cDNA as a probe identified a single band of about 2.7 
kilobases. This is expected based on the cDNA (2442 base pairs) and a 

25 poly-A tail of about 300 base pairs. Northern blot analysis of the mouse cell 

line, EMT6, and of the human large cell carcinoma cell line, QU-DB, confirm 
that no transcript for lab is produced by these cells. This is consistent with 
immunoassays which are negative for lab expression on these cells. 

Antisense and Sense cDNA Expression: The plasmld (pBK-CMV) 

30 (Sambrook, et a/., 1990) may carry either the sense or antisense full length 

cDNA lab into A549 and NIH 3T3 cells. An antisense molecule can be, for 
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example, a complementary sequence to a sense molecule that hybridizes 
with the sense molecule, preventing its expression. Using the MTT assay 
(Siddique, etal., 1992) to assess the growth rate of A549 cells expressing 
antisense to lab, a marked reduction in growth rate was noted. The antisense 
5 transfected A549 cells appear to have a greater degree of contact inhibition. 

A detectable amount of Lab is reduced in these antisense transfected cells. 
NIH-3T3 cells convert from a fibroblast-like cell type morphology (large, thin 
spindle shaped) to a large, adenocarcinoma appearing cells (very round, 
plump) when sense expression occurs. 

1 0 Chromosome Localization: The chromosome localization for lab, using 

full length cDNA as a probe via in situ hybridization (Sambrook, et al. 1990) is 
tentatively on chromosome 2q12-14, with possibly some reactivity to 
chromosomes 4 and 8. Using the same probe (the full length cDNA 
sequence of lab) and FACS sorted chromosomes (Lebo, etal. 1985) staining 

1 5 was also noted on chromosome 2, with weak staining on 4 and none on 8. 

The use of genomic clones will be of particular value in resolving these data 
because higher stringency hybridization conditions than that allowable for the 
cDNA, can be used, thereby reducing background signals. This is yet 
another proof that the correct gene was cloned and that the results are not 

20 due to a method artifact. There may be mutations in the genomic DNA of 

tumors and for the present invention, DNA was cloned from tumor cells 
(A549). Therefore, a mutated gene could have been cloned. However, that 
is not the case because the genomic DNA from a normal cell (DNA) produced 
the same sequence as what cloned as described herein. Therefore, a normal 

25 gene was cloned from A549 cells. The weak signals on chromosomes 4 and 

8 are consistent with a pseudogene or a related gene. For example, HAAH 
has been reported to be on chromosome 8q12 by in situ hybridization, so this 
result on chromosome 8 could reflect the HAAH and lab sequence homology. 
Pratain Molecular Charaatsri gatlon of Lab»rinthln> Previous work using 

30 Western blot analysis (a qualitative assay to assess antigens) has shown that 

the Lab antigen is a 40 kilodalton (by relative mobility) protein detectable in 
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A549 cells (Radosevich, et al., 1985). The epitope does not appear to be modulated 
or blocked by lectins, and is selectively expressed on the cell surface, primarily 
localized to the plasma membrane. (Radosevich, et al, 1985, 1991). Lab is sensitive 
to proteases, but not lipid or carbohydrate altering reactions (Radosevich, et al., 
5 1985). The biochemical properties of Lab are consistent with Lab being an integral 
membrane protein. 

Having a deduced amino acid sequence from the lab gene of the present 
invention, allows further characterization of the Lab protein. Extensive computer 
analysis of Lab has identified a eukaryotic leader-like sequence and theoretical 
1 0 cleavage site, 3 myristylation sequence sites, a weak membrane anchoring domain 
(MAD I), and a strong membrane anchoring domain (MAD II) (FIG. 2). [(In the 
HAAH sequence, there are 58 (theoretical) amino acids followed by a sequence 
homology in the Lab protein coding sequence and an additional 445 amino acid 3' to 
the lab sequence.)] 

1 5 When Lab is expressed as a fusion protein in a bacterial GST fusion 

expression system (pGEMEX-2T) (Amereham Pharmacia Biotech, Inc., Piscataway, 
New Jersey, 08854, USA), and subjected to Western blot analysis using the antibody 
MCA 44-3 A6, the resulting blots demonstrate that the expressed cleaved fusion 
protein has the same relative mobility as the protein detected in A549 cells. The 
20 deduced molecular weight for Lab is 28.8 kilodaltons and on Western blots it has a 
relative mobility identical to the form expressed by A549 cells (apparent relative 
mobility = 40 kilodaltons). The 55 glutamic and 27 aspartic acid residues a (82 
residues combined) are almost uniformly distributed throughout the protein (255 
amino acids total; 228 no leader sequence), except for the leader sequence and the 
25 strongest membrane anchoring domain (MAD II). These data suggest that Lab 

migrates anomelously in SDS gels. Cell lines other than A549 (e.g. adenocarcinomas 
DU-145, ATCC #HTB-81; ZR-75-1, ATCC # CRL-1504, and so forth) have an 
antigen detected with the same molecular weight antigen as Lab. Neither a 85-90 
kilodalton molecular weight species, nor a 
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52 and 56 kilodalton molecular weight species is noted when probing 
Western blots for Lab. 

Epitope Mapping Using the Antibody MCA 44-3A6 and Vaccine 
Feasibility of Lab: Using Polymerase Chain Reaction and the GST fusion 
protein system, subclones of the protein coding region were made, and 
epitopes mapped the binding of MCA 44-3A6 to six amino acids (PTGEPQ) 
representing amino acids #1 17-122 of Lab ("P" peptide). In order to 
determine this epitope, the entire coding region was divided into regions, 
Polymerase Chain Reaction primers were designed to amplify each region, 
and the subsequent expression of Polymerase Chain Reaction products were 
cloned and tested by Western blot analysis using the antibody MCA 44-3A6. 

The DNA fragment representing the positive Western blot result was 
then further subdivided. Polymerase Chain Reaction products were 
generated and cloned, expressed, and tested via Western blot. Constructs 
were made in this way both from the 5' end and the 3* end and the intervals of 
the number of amino acids were reduced upon each round. This resulted in 
the last round representing a one amino acid difference from the previous 
round (in both directions), such that one could deduce the exact binding site 
of the MCA 44-3A6. This demonstrates that at least these six amino acids 
are exposed to the external cell surface. To further prove the point, the DNA 
encoding only these six amino acids have been cloned and the fusion protein 
is positive by Western blot analysis. Synthetically prepared "P" peptide can 
be specifically detected by MCA 44-3A6, and the synthetic peptide was 
immunogenic in 5 of 5 mice tested. Computer analysis/ modelling also 
predicted that this epitope would be very immunogenic using computer 
assisted analysis (GCG programs) (Genetics Computer Group, Madison, Wl 
53703). 

Vaccine Preparation: A vaccine is a preparation of antigen(s), which 
when given to a host, results in the host producing antibodies against the 
antigen(s). The host response results in the host being immune to the 
disease to which the vaccine was directed. Vaccine treatment therefore, 
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prevents the clinical presentation of a disease, without the host being exposed to the 
disease causing agents. Lab has all the characteristics of a preferred cancer vaccine. 
The lab gene is frequently expressed by tumors which look like adenocarcinomas, is 
expressed an the outside of the cells, is expressed by all of the cells within a given 
5 cancer, is expressed at all times by these cancer cells, and is infrequently expressed by 
normal cells. Lab protein (peptides) can be produced by any number of methods 
using molecular cloning techniques, and can be produced in large quantities, thus 
making it a practical antigen to use as a vaccine. After the Lab protein has been 
purified so that it is suitable for injection into humans, it is administered to individuals 
1 0 intradermal^ , subcutaneously, or by other routes, so as to challenge the immune 
system to produce antibodies against this protein (peptides). 

The use of molecular modeling and computer assisted analysis GCG programs 
(Genetics Crystal Group, Madison, WI 53703) allows the identification of small 
portions of a molecule, slightly larger than an epitope (six to seven amino acids for 
15 proteins), wmch are expected to be on the surface of a protein molecule. Inaddition, 
the degree of hydrophobicity or hydrophilicity of a given sequence, and how 
immunogenic the sequence would be in animals, can be determined (Genetics Crystal 
Group, Madison, WI 53703). After defining which sequences meet these criteria, the 
peptides are synthetically made, or produced by a number of standard methods. One 
20 or more of these peptides can then be formulated to be used as a vaccine, and 
administered to the host as outlined above, as a vaccine. 

A vaccine comprising a molecule having an amino acid sequence selected 
from the group of sequences encoded by the cDNA of FIG. 1, sequences of FIG. 2, 
encoded by the cDNA, the peptides APPEDNPVED, EEQQEVPPDT, 
25 DGPTGEPQQE, and EQENPDSSEPV, and any fragments or combinations thereof. 

A given vaccine may be administered once to a host, or may be administered 
many times. In order for some patients to recognize a given 
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vaccine, an acjuvant may also need to be administered with the peptides. Adjuvants 
are nonspecific immune stimulators which heighten the immune readiness and aid in 
the conversion of the host from not having detectable serum antibodies to having very 
high titer serum antibodies. It is this high level (titer) of antibodies, which effectively 
5 protects the host from the diseases or conditions to which the antibodies are directed, 
Functional Studies : Studies directed at understanding the cellular functions) 
of Lab are extensions of cell localization/characterization studies (Siddique, et al., 
1992). Changes in levels of Lab in response to extracellular exposure to various 
response to extracellular exposure to various cations (Ca++, Mg++, Cu++, and Fe++) 
10 were undertaken. Lab expression in A549 cells was only modulated by Ca++. Using 
the highly specific fluorescent Fura-2/AM Ca++ method of measuring cytosolic 
Ca++, (Molecular Probes Inc., Eugene, OR 97402) it was demonstrated that; 1) the 
internal Ca++ concentration is higher in A549 cells than in QU-DB cells, and 2) that 
the A549 cell line responds to various external Ca++ levels (Siddique, et al., 1992). 
1 5 Since pH can modulate intracellular free Ca++ levels, external pH manipulations 

should result in changes in the expression levels of Lab. Extracellular pH changes (in 
the presence of normal Ca++ concentrations) result in 1) a parallel change in 
intracellular pH as measured by SNARF-1 AM/FACS, (Molecular Probes Inc., 
Eugene, OR 97402) 2) transcript levels increase for Lab (when compared to GAPDH 
20 expression via Northern blot); and that 3) Lab protein also increases (using 

Western/Slot blot analysis). The intracellular changes in pH (due to external changes) 
for A549 cells are Identical to those reported for normal cells. The increased 
expression of lab is also not due to cell death (as measured by MTT assays) 
(Siddique, et al., 1992). In addition, incubation of recombinant Lab at various pH 
25 solutions does not alter immunoreactivity. Preliminary data suggests that when these 
experiments are conducted on A549 cells grown in reduced Ca++, the induced 
expression of lab is blunted. 

Methods of Diagnosing Cancer Cells in a Sample of Cells: Biological samples 

from a subject are used to determine whether cancer cells are 



30 



99/47683 PC17US99/05365 

20 

present in the subject. Examples of suitable samples include blood and 
biopsy material. One method of diagnosis is to expose DNA from cells in the 
sample to a labeled probe that is capable of hybridizing to the lab gene, or a 
fragment thereof, under stringent conditions, e.g. 6x ssc; 0.05x blotto; 50% 
formamide; 42°C (Sambrook, era/., 1990). Of course, the hybridizing 
conditions are altered to achieve optimum sensitivity and specificity 
depending on the nature of the biological sample, type of cancer, method of 
probe preparation, and method of tissue preparation. 

After contacting the sample with the probe, the next step is 
determining whether the probe has hybridized with nucleotide sequences of 
the DNA from the sample, from which the presence of the lab gene is 
inferred, said presence being diagnostic of cancer. 

Another diagnostic method is to obtain monoclonal antibodies 
preferably labeled, either antibodies already existing, or new ones directed to 
the antigenic peptides that are aspects of the present invention, and contact 
a sample with these to detect the Lab antigen. These monoclonal antibodies 
are useful in the development of very specific assays for the detection of Lab 
antigen, and allow the tests to be carried out in many different formats; 
resulting in a broader application in science and medicine. 

The current invention is useful in that it describes a new gene which is 
expressed on the surface of tumors, which was not previously reported. This 
gene is not tissue specific, and therefore will allow the detection of tumors 
regardless of the organ in which they arise. Likewise, the use of thisgene to 
produce a vaccine for these tumors, will have a very broad application. 
Diagnostic tests will also have this broad tissue use, making the detection of 
Lab//ab a "pan-marker" for cancer, in particular for what have been 
designated previously, adenocarcinomas. 
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CLAIMS: 

1 . An isolated cDNA molecule with a nucleotide sequence as 
shown in FIG. 1. 

2. An isolated cDNA molecule with a nucleotide sequence showing 
5 about 70% homology to the nucleotide sequence in FIG. 1. 

3. A segment of the isolated cDNA molecule of claim 1 , wherein 
the segment extends from the start codon (ATG) to the stop codon TAA in the 
nucleotide sequence and includes 765 base pairs. 

4. A segment of the isolated cDNA molecule of claim 2, wherein 
1 0 the segment extends from the start codon (ATG) to the stop codon TAA, in 

the nucleotide sequence and includes about 765 base pairs. 

5. A segment of the isolated cDNA sufficient to encode an epitope 
in the Lab protein detectable by the antibody MCA 44-3A6. 

6. An amino acid sequence encoded by the cDNA segment of 
15 claim 1. 

7. An amino acid sequence encoded by the DNA segment of 
claim 2. 

8. A method to diagnose cancer cells in a sample of cells, 
said method comprising: 

20 a. contacting the sample of cells with a labeled probe 

that is capable of hybridizing to the lab gene, or a fragment thereof, 
under stringent conditions; and 

b. determining whether the probe has hybridized with 
nucleotide sequences in the sample, from which the presence of the 

25 lab gene is inferred, said presence being diagnostic of cancer. 
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9. A molecule with an amino acid sequence selected from the group consisting of 
APPEDNPVED, EEQQEVPPDT, DGPTGEPQQE, and EQENPDSSEPV. 

10. A molecule with the amino acid sequence PTGEPQ. 

5 

11. A peptide selected from the group consisting of all sequences that are between 3 
and 20 amino acids in length, aligned as are the amino acids in FIG. 2, wherein 
the position of the first amino acid of the sequence of n amino acids is selected 
from the group consisting of position 1 to n-3 to n-20. 

10 

12. An antibody directed to a peptide having an amino acid sequence selected from 
the sequence of claim 6, 8, 9, 10 or 1 1. 

13. The antibody of claim 12, further defined as a monoclonal antibody. 

15 

14. An antibody produced in mammals against an amino acid sequence encoded by 
the DNA molecule of claim 2, 3, 4, or 5 or a fragment thereof containing an 
epitope. 

20 15. A method for detecting the cDNA molecule of claim 2, 3, 4 or 5 in a sample of 
cells, said method comprising: 

(a) contacting the sample of cells with a labeled probe that is 
capable of hybridizing to the cDNA or a fragment thereof, under stringent 
conditions; 

25 (b) deterrnining whether the probe has hybridized to the 

nucleotide sequences in the sample; and 

(c) inferring the presence of the cDNA if the probe has 

hybridized. 



30 16. A method to attenuate the effects of expression of the cDNA molecule of claim 
2, 3, 4 or 5, said method comprising: 



1 , I : 

WO 99/47683 PCT7US99/05365 

27 



(a) obtaining an antisense molecule to the cDNA molecule or 
its expression product and 

(b) hybridizing the antisense molecule to the cDNA molecule 

or its expression product 

5 

17. The method of claim 16, wherein the antisense molecule is at least 100 
nucleotides in length. 

18. An isolated genomic DNA molecule from which the cDNA of claim 2 or 3 is 
10 derived. 

19. A vector comprising the isolated cDNA molecule of claim 2, 3, 4 or 5. 

20. A vector comprising a cDNA molecule that encodes a molecule with an amino 
15 acid sequence in accord with claims 6, 8, 9, 10 or 11. 
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CGGGAGCTTGAAGOACACAAGAA.TGGaAGGAAAGGCGGACTCTCAGGAAC 
T^raTTrTTgAgQTQgTTTATQGTOATTGCATTO 

CTGTAQrTG TCQTTTGGTT^^ 

AAACTAGQAATCTATGATGCTY^^^ 

TGegAAAGr rrTATTAGQACTT^ 

TggCGCCAGA AGAGGOTGAGCCACAC^CTGAGCCCGAGGAGCAGGTTCCT 

GTGGAGGCAGAACCCCAGA ATATCGAAGATGAAGCAAAAGAACAAATTC^ 

GTgCCTTCTrCCATGAAATGG TACACGCAGAACATGTTGAGGGAGAAGACr 

TGCAACAAGA AGATGGACCCACAGGAGAACCACAACAAGAGGATGATQAG 

TTTCTTATGG CGACTGATGTAGATGATAGATTTGAGACCCTGGAACTTGA 

AQTATCTCATGAAGAAACC GAGCATAGTTACCACGTGGAAQAGACAGTTT 

CACAAGACTGTAATCAGGA TATCGAAGAGATGATGTCTGAGCAGGAAAAT 

CGAGATTCCA GTGAAGCAGTAGTAGAAGATGAAAGATTGCACCATGATAC 

AGATGATGTAACATACCAA GTCTATGAGGAACAAGCAGTATATGAACCTC 

TAGAAAATGAAGGGATAGA^T^AgAgAA?T^CT9TCCCCCCT9AS9AT 

AATCCTTOTAGAAGATTCAC AGGTAATTGTAGAAGAAGTAAGCA'i'TriTCC 

TGTGGAAGA ^^AGPAGGAAGTACCACCAGATACTTAAAGCTTCAAAAAGA 

CTGCCCCTACCACCACAGGAGGACCAGCCTAACCATACGCTCCAAAAGAT 

GGCTGTGATAGATCTTGTGAAGCAATTACTGAGC^ 

GAAGGAACACTAAAGATGTTTTGAATGAATTATAGTCCACTGGCATTTTA 

CTOTA ITITITITI ' C ^ ^^ 

ACATTCCTGCATGTCCClTTTGATAGCATr^ 

ITITI ' CITIT TGTGAGACAGCTTTTAGTCTTACCTGAA 

TTTCCGAC^GTGGTTAATAATTATATTGGTGATGTAGCAGCAATTGTGTT 

GGCAGGGTTTTCATATATTATTAGTAATTAACACTAAC1X51TGGACTGAC 

T1GTGTACACTGTGTTAAACATGATTTAAAAGCTATTAAGAGTACTTTGT 

GTTAGGACTCTTAAAAACGCrrAACAGAGATCATCATTAGCT 

TGAGTTCTATATACCTGCACTGATATTCTTATCAAAAATTTCTACATTAG 

CTTTAAGTGTTCAGATTAACACTTTTGAAACCT^ 

TTAATTAGAAAAATTAATATTTCAGTGAAAGTTT^ 

TATTTTTTT AAATGAGAGGGGAAAG CTGAAATTCCTTGTTAAGACACAAG 

GAAAAAGAATGGCCCTACTATTATCATGCAAAAATGCTTTGTTGGCACCT 

CAGATTAATCATATAATAGCTATAGTCTCTTC^ 

AGAAAACCTGTATAAATTACTGGTGCATAACTTAAAGATTATTCTGCCTT 
TGGCTAATTGAGTAATTCCCCTCCAGCACTAGAGACCGCTCAGTGCTCTT 
ACTAGATGAACTCAGTAACGCCTTGAGCTGGGTTGATTGAGGATGTGTGA 
AAAAGCTCAC AG AGC CCGATGCCTG CTGCTATTTCACGGCAATGAGCCTT 
TTTCTTTCTACACTGAAGATTTTCTTCTTATT^ 

GCTCAGAAATAATTGCTCTGTTGAAAATAATCCTTTGTCAGAAAAGAAGG 

TAGCTACCACATCATTTTGAAAGGACCATGAGCAACTATAAGCAAAGCCA 

TAAGAAGTGGTTTGATCGATATATTAGGGGTAGCTCTTGATTTTGTTAAC 

ATTAAGATAAGGTGACTTTTTCCCCCTGCTTTTAGGATTAAAATCAAAGA 

TACTTCTATATTTTTATCACTATAGATC^TAGTTATTATACAATGTAGTG 

AGTCCTGCATGGGTACTCGATGTGTAATGAAACCTGAAATAATAATAAGA 

TAATAAGAAAAGCAATAATTTTCTAAAGCTGTGCTGTCGGTGATACAGAG 

ATGATACTCAAATTATAATAAAACTCTTCATTTTGTGAATTATAGAAGCT 

ACTITTTATAAAGCCATATTTTTTTAGGGAAACTAAGGAGTGACATAGAA 

CTGATGAATGAGTAAAAGTAAGTTTTGCTGGATTTTTGTAGAACTCTGGA 

CGTTGAGGATTC^TTATGCTGTGGTTAACTTTAAATATTTTT 
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(57) Abstract 

A cDNA molecule that encodes a protein designated 
Labyrinthin (Lab) is isolated and its nucleotide sequence 
is determined. The protein, or peptides derived from 
the protein, are markers useful to define novel classes of 
cancers. Diagnostic assays for these cancers use antibodies 
to Lab or nucleotide probes that hybridize with the lab 
gene or a fragment therefrom. Vaccines useful either to 
prevent recurrence of cancers in subjects who test positive 
for Lab (or lab), or to prevent initial occurrence of cancer, 
use proteins or peptides derived from Lab. Expression of 
Lab via immunogenic assays is used to monitor effects 
of cancer treatments. Antisense molecules against lab 
are used in treatments. Sense molecules of lab are used 
to restore lost lab function in diseased normal cells, for 
example, gland cells. 
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cmTAQL 1U I LU'1'l'likirrmATCTTOrnarnkT 
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TOCCAAAtn*TTTATTAfyift(Trr*Aaaftfian^TC. i'alm'i i^f tt fleeaqgaq 
TCCCQttaQhAaAOQCTa>noefcg&GBrTOaeott^ 

TOCAAC AJflAAg«m&o«acA^AqAftcca^ 

CftCftAQft CTXH'AATCAgaATATQQAAGAQ ATnRTffrtrrQAnr'&naa & »& t 



XAgftAAflTGMWgftTAqftAATCAC^OM 
AATCCmftQflAqftTTmnftQgTfc&TTgrAQAAn 

CTOCCL^TACC AC T^P > nn a f l f I> rran iX TMr!rATI>r i ffr Tn ' A AA AflAT 
OOCTOTOATAQ A1 L ' lUUIUA AOCAATTACTOAQCAaATCiAAQ AiCl T 1QU 
QAAQO AACACTAAAQATOTT TTOAATOAATTA^ 

O TOTATTT l I 1 1 1 Hi 1 m A<IAAACACACATTTCIAAAAATOTCATQTT 
ACATTCCT0CATO1 IXV1"I"1UUATAOCATTAOTOGATCCATTOGATTTCT 



TTTCOOACAOTOOTTAATAATT^^ 

OOCAOUU n'ri\ATATATTATTAOTAATTAACACTAACTqTT«3ACTt^ 
TTOTGTACAL 1 U IKi iTAAACATaAtRAAAAOCTATTAAaAOTACTTtOT 
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TOAOT T OTATAT ACCTOCACroATATTCTTATCAAAAATT^ 
CTTTAAOTOTTCAaATTAACACTTTTQAAACt 1 1 iU lAGCTTTTAQCTQA 
TT AATTAQAA AAATTAATATTTCAtnqAAAOTTTTAAATTATCATTTATT 
TATTTrrJTAAATOAQAOQOaAAAOCTQAA A TT C C I ' lWr AAQACACAAO 
aAAAAAOAATOaCCCTACTATTATCATOCXAAAATO Cl X ' ltfnW CACCT 
CAOATTAATCATATAATAOCTATAOTCTCTTCAQCA IUUU 1 A AATTTT 
AflAAAACCTOTATAAATTACTOOTOCATAACTTAAAQATTATTCTOCCTT 
TOTCTAATrgAOTAATTCCCCtCCAOCACtAOAQACOOCTCAO TOCT 
ACTAnATOAACTCWTAAaKXriTaAOCTOOCrr^ 
AAAAOCrCACAflACCCCaATOCCroCTOCTATTTCAOOOCAATQA^ 
1 ' £ _ ? I J CTACACT0A *Q Ai illLULl 1A TTTA A 1 U 1UUI I1A TTTTQO 
OCTCaaAMTAAT TOCTCTO rmAAAATA A T CCl 1 1U ILA flAAAAOAAOQ 
^A^TAOCACATCATTTTOAAAOOACCATQAOCAACDIXAAOCAAAOCCA 
TAAOAAO llWiriUA TOOftTA TAlTAOOOQTAOCTCTTQATTTTOTTAAC 
AT TAAOAT AAOgTOAL 11111 IVIAXTOCTTTTAOQATIAAAATCAAAQA 
TACTTCTATATTTTTATCACTATAaATCATAOTTATTATACAATCn'ACTro 
AOTCCTCCATOOOTACTOOATOTOTAATUfcAACCTOAAATAATAAT^ 
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ATOATACTCAAATTA T a a TA AAA C J L 11 UU 1 1 1 U iUA ATTATAQAAOCr 
ACT TO ATAAAC CC AT A1 1 11 i I 1A OWAAACTAAQOAOT0ACAIA0AA 
CTaATO AATqAOTAAA AQTA AUU 1 1UC1UUA1 1 L I iUlA QAACTCTOOA 
OOTTOAOQATTC A T TA T OC TO lOt/r TAACTTTAAAT AT TUT 
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GENE ENCODING LABYRINTHIN, A MARKER FOR CANCER 



The invention relates to a gene encoding a protein and peptides therefrom that 
includes an epitope, a cancer associated antigen, useful as a marker that is not 
restricted to previously defined histological classes of cancer. Antigenic peptides are 
useful as a vaccine for treatment and prevention of cancer, and for the preparation of 
new, specific, monoclonal antibodies. Antisense molecules are useful in 
pharmaceutical compositions and are useful for diagnosis and treatment. 

BACKGROUND OF THE INVENTION 

Cancer 1 is a leading cause of death in men and women throughout the world. 
In the United States alone, over I million new cases are diagnosed each year, and over 
0.5 million deaths are reported annually (Landis, et al y 1998). Historically, tumors 
are grouped and treated, based in part by the tissues in which they arise, e.g.- breast 
cancer, colon cancer, and lung cancer, and the like. Yet, within lung cancer, for 
example, it is well recognized that these tumors are a very heterogeneous group of 
neoplasms. This is also true for tumors arising in other tissues. In part, because of 
this heterogeneity, there are complex and inconsistent classification schemes which 
are used for human tumors. Previous attempts to treat cancer have been hampered by 
1) the arbitrary classification of tumors arising within given tissues, and 2) by using 
microscopic methods based on how these tumors look (histological classification). 
Although existing classifications for 



Terminology used herein is as follows, "cancer is a malignant tumor, wherein a 
"tumor" is an abnormal mass of tissue, that need not be malignant. "Neoplasm" 
is a form of new growth. 
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various tumor types have some prognostic value, almost all of the 
classifications fail to predict responsiveness to treatments and likelihood of 
cure or disease course. Improved classification schemes based on the 
biological constitution of these neoplasms is required to significantly alter the 
survival statistics of humans who have cancer. One approach to solving 
these problems is to locate molecules specific to tumors, preferably antigens 
in molecules that are markers for cancer cells. (A "marker" is defined herein 
as any property which can be used to distinguish cancer from normal tissues 
and from other disease states.) The markers' presence is then a basis for 
classification. 

Monoclonal antibodies (MCAs) prepared by somatic cell hybridization 
techniques, usually in mice, are useful molecular probes for the detection and 
discrimination of cellular antigens, and therefore have great potential for 
detecting cancer associated antigens. These antibodies bind to specific 
antigens and the binding is detectable by well known methods. When 
binding occurs, the inference is made that a specific antigen is present. 
Those cancer associated antigens which are exposed to the cell surface or 
found in the cancer mass, are molecular targets for the immune systems 
(including host antibodies) of the host. Recent findings suggest that cancer 
patients who have antibodies against their tumors, do better than those who 
do not mount this type of immune response (Livingston, et a/., 1994). 
Therefore, natural, induced, or administered antibodies are a promising 
therapeutic approach. 

The humanization of non-human MCAs (the process by which 
non-human MCA reactive sites are shuttled into cloned human antibodies 
and expressed) results in reduced immunogenicity of the foreign antibodies 
without the loss of their specific binding in in vivo and in ex vivo applications. 
MCAs can be used as in vivo imaging agents, diagnostic ttsta, end fer 
mtripy (faeaetvieh, at eh 1988, 1990! Resen, etal. 1988). 

Vaccine therapy is a well established approach directed at inducing an 
immune response without exposure to the causative agent of a disease or 
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condition. Many vaccines are available, for example, to stimulate a response 
in a host to bacterial and viral agents. The use of tumor associated antigens 
(markers) in a vaccine could prevent primary cancer occurrence, and could 
also provide a means to prevent recurrence of the disease. 
5 Gene therapy is a means by which the genetic make-up of cells is 

modified to express the gene of interest. There are many forms of gene 
therapy including: gene replacement, antisense suppression therapy, and 
surrogate gene expression. Discovering genes encoding cancer-associated, 
preferably cancer-specific antigens (markers) opens the door to genetic 

10 intervention against cancer cell proliferation. The accurate and consistent 

use of a cancer marker to differentiate cancerous from normal tissue, not only 
has diagnostic potential, but is also desirable for treatment and prognosis. 
Therefore, such markers have been sought. 

Recent studies have shown that the enzyme encoding human aspartyl 

15 beta-hydroxylase (HAAH) is overexpressed in some human adenocarcinoma 

cell lines, and in primary hepatocellular cancers, therefore could be a marker. 
The gene said to encode HAAH has been cloned and sequenced (Gronke, 
era/., 1989, 1990; Wang, ef a/., 1991; Jia, era/., 1992, 1994; Korioth, etal., 
1994; Lavaissiere, era/., 1996). However, little is known about HAAH 

20 expression in human tumors in general (Lavaissiere, ef a/., 1 996). 

The study of the HAAH enzyme grew out of the study of its bovine 
counterpart (Gronke, etal., 1989, 1990; Wang, etal., 1991; Jia, etal., 1992). 
Bovine aspartyl beta-hydrqxylase is an intracellular, glycosylated protein, 
localized in the rough endoplasmic reticulum. The protein has been reported 

25 to have three major species of molecules; a 85 kilodalton form, and two active 

forms with molecular weights of 56 and 52 kilodaltons respectively 
(Lavaissiere, etal., 1996). 

Using standard biochemical methods, bovine aspartyl 
beta-hydroxylase (bAAH) has been purified and characterized ( Gronke, et a/. 

30 1 990; Wang, ef al. , 1991). The activity of the enzyme has been shown to be 

correlated with the 52 and 56 kilodalton species which were purified. 
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Immunologically, a related higher molecular weight form (85-90 kilodalton) was also 
observed. As part of the purification, bAAH is bound to Con A sephaiose, which is 
consistent with the conclusion that the enzyme is glycosylated. (Subsequent reports 
on the DNA sequence show three possible glycosylation sites, with one site being 
5 very close to the known active enzyme domain.) The protein is very acidic in nature, 
and a detergent is not required to solubiHze the active fraction. The active enzyme 
site is dependent from the biochemically isolated bovine protein (bAAH) on the 
presence of histidine at position 675 (Jia, et al, 1994). 

A partial amino acid sequence was obtained for HAAH. DNA probes (a DNA 
10 probe is a molecule having a nucleotide sequence that is capable of binding to a 

specified nucleotide sequence under certain conditions) deduced from this amino acid 
sequence was used to screen a bovine cDNA library (Jia, et ai 9 1992). (A cDNA 
library contains the sections of DNA that encode for gene products, e.g. peptides as 
opposed to genomic DNA). Several overlapping cDNA sequences in the library 
1 5 contained a 764 amino acid open reading frame (ORF) sequence which will be 

expected to encode an 85 kilodalton protein. Also present in this ORF sequence were 
two other possible start codons, that is, locations at which encoding begins. The most 
3' start codon was preceded by a ribosome binding site. Translation of the clone 
having this sequence resulted in a protein that was about 85 kilodaltons. Antiserum 
20 was raised to the membrane fraction of human MG-63 calls and was used to 

immunoscreen a cDNA library made from MG-63 cells. Data on one clone was 
reported which could encode a 757 amino acid protein, and, by sequence analysis, was 
found to have strong N-tenninal homology with bAAH (Korioth, et ai 9 1994). When 
this clone was used in an in vitro translation system (an artificial cocktail of normal 
25 cell cytoplasm used to convert mRNA into protein), a 56 kilodalton protein was 
produced. It was suggested that this was due to posttranslational cleavage. 

The HAAH enzyme is responsible for the modification of specific aspartic 
acid residues within the epidermal growth factor-like domains of 
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proteins. It has been hypothesized that these modified aspartic acid residues 
allow the epidermal growth factor-like domains to become calcium binding 
domains. (Gronke, et a/., 1989, 1990; Wang, era/., 1991; Jia, et a/., 1992, 
1994; Korioth, et a/„ 1994; Lavaissiere, etal., 1996). 
5 An enzyme related to HAAH, aspartyl beta-hydroxylase (AAH), was 

first studied because it specifically modified select aspartic acid or 
asparagine residues in a group of biologically important proteins including the 
vitamin K-dependent coagulation factors VII, IX, and X. Other proteins like C, 
S, and Z also have this modification (Gronke, etal., 1989, 1990; Wang, etal., 

10 1991; Jia, etal., 1992, 1994; Korioth, era/., 1994; Lavaissiere, etal., 1996). 

Aspartic acid and asparagine residues have been shown to be modified by 
HAAH in proteins containing epidermal growth factor-like domains. The 
function of the beta-hydroxyaspartic and beta-hydroxyasparagine residues is 
unknown, however, it has been speculated that this modification is required 

1 5 for calcium binding in the epidermal growth factor EGF-like domains of 

selected proteins. 

Antibodies were raised to human hepatocellular carcinoma FOCUS 
cells (Lavaissiere, et a/., 1990). One MCA reacted with an antigen that was 
highly expressed in hepatocellular carcinomas (Lavaissiere, etal., 1996). 
20 Immunoscreening using this antibody and a lambda gt1 1 HepG2 library 

resulted in the isolation of a partial cDNA, which was subsequently used to 
isolate a larger clone. 

A human adenocarcinoma cell line designated A549 was reported as 
having very high levels of HAAH activity (Lavaissiere, et a/., 1996). A mouse 
monoclonal antibody designated MCA 44-3A6 (U.S. Patent No. 4,816,402) 
was produced against the human adenocarcinoma cell line A549 (ATCC 
accession number CCL 185) (Radosevich, et at., 1985). The antibody 
recognized a cell surface, non-glycosylated antigenic protein having an 
estimated apparent molecular weight of 40 kDa). 

The antigen was expressed by A549 cells, and was found to be a good 
adenocarcinoma marker; that is, it was frequently expressed by cancers 
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which looked like adenocarcinomas when examined histologically 
(Radosevich, ef a/., 1990a; Lee, etaL, 1985). MCA 44-3A6 is unique in that it 
is the first monoclonal antibody which has this binding specificity. The results 
from an International Workshop for Lung cancer confirmed other related 
5 published findings on MCA 44-3A6 (Stahel, 1 994). 

The antibody designated MCA 44-3A6 has clinical utility because it 
differentiates antigens associated with adenocarcinomas. The normal and 
fetal tissue distribution of the antigen is restricted to some glandular tissues 
(Radosevich, et a/., 1991). Detection can occur on formalin fixed-paraffin 
10 embedded tissue (Radosevich, ef a/., 1985, 1988, 1990a, 1990b; Lee, et a/., 

1985, 1986; Piehl, etaL 1988; Combs, ef a/., 1988b, 1988c; Banner, etal., 
1985). The antibody has a restricted binding pattern within human pulmonary 
tumors (Lee, etal., 1985; Banner, etal., 1985; Radosevich, etaL, 1990a, 
1990b). 

15 In a study of over two hundred pulmonary cancers, MCA 44-3A6 was 

found to react with all of the adenocarcinomas tested, many of the large cell 
carcinomas, as well as with subsets of intermediate neuroendocrine small cell 
lung cancers, well-differentiated neuroendocrine small cell carcinomas, 
carcinoids, but not mesotheliomas. MCA 44-3A6 does not react with 
20 squamous cell carcinoma, bronchioloalveolar carcinoma, or small cell 

carcinoma (Lee, ef a/., 1985). MCA 44-3A6 is useful in distinguishing 
adenocarcinomas that are metastatic to the pleura from mesothelioma (Lee, 
etal., 1986). The antibody has selected reactivity among adenocarcinomas 
and in large cell carcinomas (Piehl, ef a/., 1988; Radosevich, ef a/., 1990b). 
25 ,n a stud y of ov er 40 cases of lung cancer comparing cytological and 

histological findings, MCA 44-3A6 was found to be useful in cytological 
diagnosis and was consistent with the histological finding (Banner, etal., 
1985). Histology is the study of tissues (which are made of cells). Cytology 
Is tha itudy of ooili whlah have been removed from the organizational context 
which is commonly referred to as tissue. Cells removed from tissues do not 
always behave the same as if they were in the tissue from which they were 
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derived. Fortunately, the antigen detected by MCA 44-3A6 expressed in 
adenocarcinoma cells in tissue behaves in the same ways as 
adenocarcinoma cells removed from tissues. This is a very diagnostically 
important characteristic. Similar correlations using cytologically prepared cell 
5 blocks of pulmonary carcinomas, as well as ACs presenting in body fluids 

from other sites throughout the body were demonstrated (Lee, et a/., 1985; 
Spagnolo, etal., 1991; Combs, et a/., 1988c). Also, MCA 44-3A6 binds to 
adenocarcinomas from sites other than lung cancer. The expression of the 
antigen in primary and metastatic lesions was also reported ( Combs, et a/., 

1 0 1 988a). The utility of the MCA antibody in differentiating cancer from benign 

lesions in human breast tissue was also noted (Duda, etal., 1991). 

The cellular localization of the antigen detected by MCA 44-3A6 was 
determined. By using live cell radioimmunoassays (a radioactive antibody 
test directed at determining binding of the antibody to live cells), 

15 immunofluorescence, and live cell fluorescence activated cell sorter (FACS) 

analysis, the antigen detected by MCA 44-3A6, was shown to be on the 
outside surface of the cell (Radosevich, et a/., 1985). Additional studies 
using immunogold-electron microscopy and FACS analysis have 
demonstrated that this antigen is non-modulated (that is not internalized by 

20 the cancer cell when bound by an antibody), is expressed on the extracellular 

surface of the plasma membrane, and is not cell cycle specific that is, the cell 
makes protein all the time it is going through the process of cell replication, 
and also when it is not dividing (Radosevich, et al., 1 991 ). The antigen is not 
found in the serum of normal or tumor bearing patients, and is not shed into 

25 the culture media by positive cell lines (that is, cancer cells are known to bleb 

off portions of their cell membranes and release them into the surrounding 
fluid.) (Radosevich, etal., 1985). Recently 3 of 27 randomly tested 
adenocarcinoma patients were found to have naturally occurring antibodies 
to the antigen. In addition, radiolabeled MCA 44-3A6 was used to localize 

30 A549 tumors growing in nude mice. A douxorubicin immunoconjugate 

MCA 44-3A6 is selectively toxic in vitro (Sinkule, et a/., 1991). 
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Determination of the nucleotide and amino acid sequences of the antigen 
detected by MCA 44-3A6 would enhance the usefulness of this antigen in cancer 
diagnosis, treatment and prevention. 



5 BRIEF SUMMARY OF THE INVENTION 

The antigen detected by the antibody MCA 44-3 A6 as described in the 
Background is now designated as "Labyrinthin." A gene (designated labyrinthin; 
abbreviated lab) characterized by a unique nucleotide sequence that encodes the 
antigen detected by MCA 44-3A6 was isolated and characterized, (lab notation 
1 0 signifies the nucleic DNA/RNA forms; "Lab" notation refers to the protein which is 
encoded by the lab DNA/RNA). 

The invention described herein used the antibody MCA 44-3 A6 as a tool to 
clone the gene encoding Lab. In addition, an epitope (the necessary binding site for 
an antibody found on the antigen) for MCA 44-3A6 was identified on the Lab protein 
1 5 expressed by the clone to be PTGEPQ. 2 The epitope represents an important 

immunodominent sequence; that is, when injected into animals, the animals readily 
produce antibodies to this sequence. 

An aspect of the invention is the use of lab DNA in the sense 3 expression 
mode for. 1) the marking of human tumors by nucleotide probes; 2) the detection of 
20 DNA and mRNA expression of lab in cells and tissues; 3) the transformation of cells 
into a glandular-like cell type; 4) the production of Lab antigen in vivo for 
immunization; 5) the ex vivo expression of Lab for immunization to produce 
antibodies; and 6) production of Lab in vitro. Use of an antisense molecule, e.g. by 
production of a mRNA or DNA strand in the reverse orientation to a sense molecule, 
25 to suppress the growth of labyrinthin-aqtTessmg (cancerous) cells is another aspect of 
the invention. 



Standard abbreviations for amino acids. 

The normal transcription of a DNA sequence which proceeds from the 3' to the 
5' end to produce a mRNA strand from the sense strand of DNA, the mRNA 
being complementary to the DNA. 
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An aspect of the invention is a vector comprising a DNA molecule with a 
nucleotide sequence encoding at least an epitope of the Lab antigen, and suitable 
regulatory sequences to allow expression in a host cell. 

Another aspect of the invention is an amino acid sequence deduced from the 
5 protein coding region of the lab gene. Selected regions of the sequence were found 
via immunological methods, to produce effects corresponding to effects from both 
naturally occurring (from cancer cells), chemically produced (synthetically produced 
peptides), and expression products of the cloned lab gene. 

Another aspect of the invention is the use of the entire deduced amino acid 
10 sequence of Lab, peptides derived from Lab, or chemically produced (synthetic) Lab 
peptides, or any combination of these molecules, for use in the preparation of vaccines 
to prevent human cancers and/or to treat humans with cancer. For purposes of the 
present invention, "humans with cancer" are those persons who have the Lab antigen 
detected in their cells. These preparations may also be used to prevent patients from 
1 5 ever having these tumors prior to their first occurrence. 

Monoclonal antibodies directed to the Lab protein, or antigenic components or 
derivatives of Lab proteins, are useful for detection of Lab and for other purposes. 
Monoclonal antibodies which are made in species other than those which react with 
the Lab antigen can be modified by a number of molecular cloning methods such that 
20 they retain their binding with the Labyrinthin peptides, yet are not immunogenic in 
humans (Sastry, et al, 1989; Sambrook, et al, 1990). In brief, this is done by 
replacing the binding site sequence of a cloned human antibody gene, with the 
binding site sequence of the non-human monoclonal antibody of interest. These 
"humanized" MCAs are used as therapeutic and diagnostic reagents, in vivo, ex vivo, 
25 and in vitro. 

The use of the Lab protein or antigenic peptides derived therefrom in 
diagnostic assays for cancer is a way to monitor patients for the presence and amount 
of antibody that they have in their blood or other body fluids or tissue. This detection 
is not limited to cancers of a class or classes previously defined, but is useful for 
30 cancer cells that have the Lab marker 
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antigen. The degree of seroconversion, as measured by techniques known to those of 
skill in the art [e.g., ELISA (Engrail and Perlmann, 1971)] may be used to monitor 
treatment effects. 

Treatment with antisense molecules to lab or antibodies to Lab in a 
5 pharmaceutical composition is an approach to treat patients who have Lab in, or on, 
their cancer cells. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is the nucleic acid sequence of the lab gene. 
10 FIG. 2 is the amino acid sequence for Lab, deduced from the lab gene. 

FIG. 3 is an illustration of the lab gene and how it is related to the HAAH 
enzyme. 

DETAILED DESCRIPTION OF THE INVENTION 
15 Molecular Biology of Labvrinthm : To demonstrate that the epitope MCA 

44-3A6 is encoded by a protein sequence, high molecular weight DNA from the cell 
line A549 was isolated. This DNA was co-precipitated (via calcium) with a plasmid 
(pSVneo), and used to transfect a mouse cell line designated B78H1 cells (Albino, et 
al, 1985). This mouse cell line is negative for the expression of the epitope and was 

20 reported to have a high frequency of incorporation and expression of any human DNA 
sequences. If a given B78H1 cell was in a state to take up DNA, it would be expected 
to have taken up both human DNA and the plasmid DNA. The plasmid DNA makes 
the cell resistant to G418 (a normally toxic drug). Therefore, if a cell normally 
sensitive to G41 8 growth inhibitor grows in G418, it had to have taken up the 

25 plasmid, and may also have taken up one or more A549 DNA sequences. After G41 8 
selection (a way of choosing only cells which have resistance to growth in G41 8 by 
the uptake/expression of the Neo gene on pSVneo plasmid, and therefore representing 
cells that were in a state to uptake other DNA at the same time), approximately 15 of 
1x10 s clones were detected using immunoselection with MCA 44-3 A6. This finding 

30 is consistent 
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with a conclusion that human A549 cells have DNA that encodes Lab and 
possesses the regulatory sequences necessary for the expression of Lab. 

Comparison of HAAH and Labvrinthin: Because the DNA sequence of 
lab was determined as an aspect of the present Invention, HAAH and lab 
5 could be compared. HAAH and the lab nucleotide sequences have some 

internal fragment similarities, but are different on either side of the fragment, 
and are related to different products. This conclusion is based in part by the 
analysis and homology of the DNA sequences reported for these two genes. 
Specifically, the lab 5' region has no homology with HAAH. The protein 

10 coding region of lab has about a 99.6% homology with an internal segment of 

the proposed protein coding region for HAAH. The 3' region has no 
homology with the HAAH reported sequence. Virtually all of the other data 
comparing HAAH and labyrinthin are different, for example: 1) molecular 
weights of the proteins, 2) cellular localization, 3) chromosome localization, 

15 4) histological presentation in normal tissues and tumors, 5) northern blot 

expression, 6) immunological findings. 

Although the protein coding region of lab is identical to an internal 
region of the sequence reported for HAAH, the 5' untranslated region of 
HAAH is different, and part of the 5' translated protein coding region of HAAH 

20 is missing from that found in the lab clone. From both HAAH and lab clones, 

the deduced protein would be expected to be very acidic in nature, and 
therefore would run anomolously in SDS gels. As predicted, the Lab protein 
migrates anomolously in SDS gels. What was cloned and disclosed in the 
present invention migrates identically to the native protein found in several 

25 cell lines. Convincing evidence that the correct gene fragment encoding the 

antigen detected by MCA 44-3A6 has been cloned (mRNA) is that when the 
recombinant protein is made, that recombinant protein should act (in this 
case - have an apparent molecular weight) the same as independent 
biologically derived source of that protein. Lab provided from clones has the 

30 characteristics of Lab from cells. 
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The deduced amino acid sequence encoded by HAAH requires the use of an 
open reading frame which would produce a protein that is 85-90 kilodaltons, and does 
not take into account mat there are several start codons and other shorter open reading 
frames. The deduced HAAH protein (biochemically) is glycosylated and the reported 
5 sequence has glycosylation sites ( Korioth, et aL, 1994; Lavaisslere, et ai, 1996). To 
the contrary, Lab is not glycosylated, nor does it have predicted glycosylated sites. 

The deduced HAAH amino acid sequence contains a region shared by the Lab 
amino acid sequence which is predicted to be very hydrophobic. Lab requires strong 
detergents in order to be soluble; HAAH does not. The increased expression of 
10 HAAH (by enzyme activity measurements) in the same cell line (A549) which was 
used to clone and study lab extensively, suggests that both of these gene products may 
be important to the AC phenotype and that at least A549 cells make both functional 
HAAH and Lab. Successful transfections of the antisense to lab into A549 resulted in 
a marked decrease in expression of lab and in the growth rate of the cells. The 
1 5 expression of a sense lab construct in NIH-3T3 cells (normal mouse fibroblasts) 
resulted in a marked change in phenotype, a phenotype consistent with that of ACs. 
Therefore, lab expression is associated with conversion of normal cells to cancerous 
cells. Lab and HAAH have potential calcium binding domains in common. 

cDNA Library Contraction and Cloning : A cDNA lambda gtl 1 phage library 
20 was constructed using mRNA which was isolated from actively growing A549 cells 
(Sambrook, et ai, 1990). This oligo(dT)-primed cDNA was cloned into the Eco RI 
site using Eco RI linkers. The library has about 83% clear (containing an insert) 
plaques with a titer of 1.2 x 10 lo /ml representing a minimum of 1.46 x 10 6 
independent plaques which, by Polymerase Chain Reaction, have insert sizes ranging 
25 from 0.6 to 5 kilobases. Since Lab is a 40 kilodalton integral protein, (a protein which 
is embedded in the plasma membrane) the theoretical full length mRNA encoding this 
protein, including a potential leader sequence is estimated to be about 1.1 kilobases. 
This library was 
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immunoscreened using the antibody MCA 44-3A6, Eight independently derived 
phage stocks (identical phage which are from the same plaque) were isolated. These 
have all been plaque purified by repeated cycles of immunoscreening/isolation. Upon 
Eco RI digestion of these eight isolates, inserts of about 2kb were seen. The largest 
5 insert was isolated (2A1A1) and the Eco RI fragment was cloned into the pGEM-3Z 
plasmid. 

Sequencing and Sequence Analysis : The DNA fragment designated 2 A1A1 
was found to have an insert of 2442 base pairs in length (FIG. 1), containing a 5* 
untranslated region, a ribosome binding site, and a start codon which would be 

10 expected to encode a 255 amino acid protein (FIG. 2). The 3' untranslated region is 
. remarkable in that it contains only four instability sequences; ATTTA (Xu, et al, 
1997). In addition there are sequences found in the very 3' end of mRNA's which 
result in adenylation of the mKNA (Sambrook, et ai, 1990). The lab sequence 
contains both a sub-optimal (ATTAAA) and optimal (AATAAA) poly-adenylation 

1 5 site. These are sequences found in the very 3* end of mKNA's which result in 

adenylation of the mRNA. This finding provides molecular data which supports the 
cellular and biochemical data that has been outlined herein. (The HAAH clone has a 
poly A signal, but the whole 3* region has not been sequenced.) 

A calcium binding site motif is noted in the Lab amino and sequence (FIG. 2), 

20 however, it is out of the known required structural context to be a binding site. In this 
case, the calcium limiting sequence is there, but it is not in a protein sequence context 
that is known to make it work as a binding site. Homology was noted with lab and an 
EST clone (designated #05501) which represented only a portion of the 3* 
untranslated region and independently confirmed this portion of the sequence. Some 

25 internal fragment homology is also noted with HAAH, but the 5' untranslated and part 
of the 5' translated region is different (58 amino acids), as well as a major portion of 
the 3* coding region is missing in lab (FIG. 3). 

Genomic DNA Cloning and Analysis : Using a PCR fragment representing the 
protein coding region of lab as a probe, a genomic lambda 
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FIX II library made from the human pulmonary fibroblast cell line WI-38 was 
screened. Ten primary plaques were isolated out of approximately 1x10 6 
screened plaques. Using seven of these as target DNA, Polymerase Chain 
Reaction conditions were established with primers for the protein coding 

5 region, producing a 765 base pair fragment, the expected protein coding 

region for lab. On Northern blots (a method used to qualitatively assess 
mRNA) lab only detects one band noted at 2.7 kilobases. The recombinant 
protein made from the lab clone, when tested on Western blots (a method 
used to qualitatively define proteins) using MCA 44-3A6, has the same 

10 relative mobility as the Lab protein when made by A549 cells. 

Lab and HAAH genes give different results in the proteins they 
encode. HAAH consistently gives two bands on Northern blot analysis (2.6 
and 4.3 kilobases) suggesting that the 2.6 kilobase band is due to alternative 
splicing, i.e. the cell cuts and splices the mRNA. Also, if lab and HAAH are 

15 the same gene, HAAH should be detected in all tissues and cancer cell lines 

in which Lab is found. However, Lab is not seen on Northern blots of cell 
lines EMT6 or QU-DB, nor is there immunoreactivity in these cells; indicating 
that Lab mRNA is not made, and that Lab protein is not produced in these 
cells. Lab protein is rarely expressed in normal cells, where both the HAAH 

20 mRNA and HAAH protein have been reported to be expressed in almost 

every tissue studied. 

mRNA Analysis: Northern blot analysis of the DNA fragment from the 
A549 cell line using Jab cDNA as a probe identified a single band of about 2.7 
kilobases. This is expected based on the cDNA (2442 base pairs) and a 

25 poly-A tail of about 300 base pairs. Northern blot analysis of the mouse cell 

line, EMT6, and of the human large cell carcinoma cell line, QU-DB, confirm 
that no transcript for lab is produced by these cells. This is consistent with 
immunoassays which are negative for Jab expression on these cells. 

Antlsense and Sense cDNA Expression: The plasmld (pBK-CMV) 

30 (Sambrook, et a/., 1 990) may carry either the sense or antisense full length 

cDNA lab into A549 and NIH 3T3 ceils. An antisense molecule can be, for 
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example, a complementary sequence to a sense molecule that hybridizes 
with the sense molecule, preventing its expression. Using the MTT assay 
(Siddique, et a/., 1 992) to assess the growth rate of A549 cells expressing 
antisense to lab, a marked reduction in growth rate was noted. The antisense 
5 transfected A549 cells appear to have a greater degree of contact inhibition. 

A detectable amount of Lab is reduced in these antisense transfected cells. 
NIH-3T3 cells convert from a fibroblast-like cell type morphology (large, thin 
spindle shaped) to a large, adenocarcinoma appearing cells (very round, 
plump) when sense expression occurs. 

1 0 Chromosome Localization: The chromosome localization for lab, using 

full length cDNA as a probe via in situ hybridization (Sambrook, et al. 1990) is 
tentatively on chromosome 2q12-14, with possibly some reactivity to 
chromosomes 4 and 8. Using the same probe (the full length cDNA 
sequence of lab) and FACS sorted chromosomes (Lebo, etal. 1985) staining 

1 5 was also noted on chromosome 2, with weak staining on 4 and none on 8. 

The use of genomic clones will be of particular value in resolving these data 
because higher stringency hybridization conditions than that allowable for the 
cDNA, can be used, thereby reducing background signals. This is yet 
another proof that the correct gene was cloned and that the results are not 

20 due to a method artifact. There may be mutations in the genomic DNA of 

tumors and for the present invention, DNA was cloned from tumor cells 
(A549). Therefore, a mutated gene could have been cloned. However, that 
is not the case because the genomic DNA from a normal cell (DNA) produced 
the same sequence as what cloned as described herein. Therefore, a normal 

25 gene was cloned from A549 cells. The weak signals on chromosomes 4 and 

8 are consistent with a pseudogene or a related gene. For example, HAAH 
has been reported to be on chromosome 8q12 by in situ hybridization, so this 
result on chromosome 8 could reflect the HAAH and lab sequence homology. 
Prataln Molaaular fih«raBtan«tlon of Labvrtnthln: Previous work using 

30 Western blot analysis (a qualitative assay to assess antigens) has shown that 

the Lab antigen is a 40 kilodalton (by relative mobility) protein detectable in 
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A549 cells (Radosevich, et ai, 1985). The epitope does not appear to be modulated 
or blocked by lectins, and is selectively expressed on the cell surface, primarily 
localized to the plasma membrane. (Radosevich, et al, 1985, 1991). Lab is sensitive 
to proteases, but not lipid or carbohydrate altering reactions (Radosevich, et al, 

5 1985). The biochemical properties of Lab are consistent with Lab being an integral 
membrane protein. 

Having a deduced amino acid sequence from the lab gene of the present 
invention, allows further characterization of the Lab protein. Extensive computer 
analysis of Lab has identified a eukaryotic leader-like sequence and theoretical 

10 cleavage site, 3 myristylation sequence sites, a weak membrane anchoring domain 
(MAD I), and a strong membrane anchoring domain (MAD II) (FIG. 2). [(In the 
HAAH sequence, there are 58 (theoretical) amino acids followed by a sequence 
homology in the Lab protein coding sequence and an additional 445 amino acid 3' to 
the lab sequence.)] 

1 5 When Lab is expressed as a fusion protein in a bacterial GST fusion 

expression system (pGEMEX-2T) (Amereham Pharmacia Biotech, Inc., Piscataway, 
New Jersey, 08854, USA), and subjected to Western blot analysis using the antibody 
MCA 44-3 A6, the resulting blots demonstrate that the expressed cleaved fusion 
protein has the same relative mobility as the protein detected in A549 ceils. The 
20 deduced molecular weight for Lab is 28 .8 kilodaltons and on Western blots it has a 
relative mobility identical to the form expressed by A549 cells (apparent relative 
mobility = 40 kilodaltons). The 55 glutamic and 27 aspartic acid residues a (82 
residues combined) are almost uniformly distributed throughout the protein (255 
amino acids total; 228 no leader sequence), except for the leader sequence and the 
25 strongest membrane anchoring domain (MAD II). These data suggest that Lab 

migrates anomelously in SDS gels. Cell lines other than A549 (e.g. adenocarcinomas 
DU-145, ATCC # HTB-81; ZR-75-1, ATCC # CRL-1504, and so forth) have an 
antigen detected with the same molecular weight antigen as Lab. Neither a 85-90 
kilodalton molecular weight species, nor a 
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52 and 56 kilodalton molecular weight species is noted when probing 
Western blots for Lab. 

Epitope Mapping Using the Antibody MCA 44-3A6 and Vaccine 
Feasibility of Lab: Using Polymerase Chain Reaction and the GST fusion 
5 protein system, subclones of the protein coding region were made, and 

epitopes mapped the binding of MCA 44-3A6 to six amino acids (PTGEPQ) 
representing amino acids #1 17-122 of Lab ("P" peptide). In order to 
determine this epitope, the entire coding region was divided into regions, 
Polymerase Chain Reaction primers were designed to amplify each region, 

1 0 and the subsequent expression of Polymerase Chain Reaction products were 

cloned and tested by Western blot analysis using the antibody MCA 44-3A6. 

The DNA fragment representing the positive Western blot result was 
then further subdivided. Polymerase Chain Reaction products were 
generated and cloned, expressed, and tested via Western blot. Constructs 

1 5 were made in this way both from the 5' end and the 3' end and the intervals of 

the number of amino acids were reduced upon each round. This resulted in 
the last round representing a one amino acid difference from the previous 
round (in both directions), such that one could deduce the exact binding site 
of the MCA 44-3A6. This demonstrates that at least these six amino acids 

20 are exposed to the external cell surface. To further prove the point, the DNA 

encoding only these six amino acids have been cloned and the fusion protein 
is positive by Western blot analysis. Synthetically prepared "P" peptide can 
be specifically detected by MCA 44-3A6, and the synthetic peptide was 
immunogenic in 5 of 5 mice tested. Computer analysis/ modelling also 

25 predicted that this epitope would be very immunogenic using computer 

assisted analysis (GCG programs) (Genetics Computer Group, Madison, Wl 
53703). 

Vaccine Preparation: A vaccine is a preparation of antigen(s), which 
when given to a host, results in the host producing antibodies against the 
30 antigen(s). The host response results in the host being immune to the 

disease to which the vaccine was directed. Vaccine treatment therefore, 
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prevents the clinical presentation of a disease, without the host being exposed to the 
disease causing agents. Lab has all the characteristics of a preferred cancer vaccine. 
The lab gene is frequently expressed by tumors which look like adenocarcinomas, is 
expressed an the outside of the cells, is expressed by all of the cells within a given 
5 cancer, is expressed at all times by these cancer cells, and is infrequently expressed by 
normal cells. Lab protein (peptides) can be produced by any number of methods 
using molecular cloning techniques, and can be produced in large quantities, thus 
making it a practical antigen to use as a vaccine. After the Lab protein has been 
purified so that it is suitable for injection into humans, it is administered to individuals 
10 intradermally, subcutaneously, or by other routes, so as to challenge the immune 
system to produce antibodies against this protein (peptides). 

The use of molecular modeling and computer assisted analysis GCG programs 
(Genetics Crystal Group, Madison, WI 53703) allows the identification of small 
portions of a molecule, slightly larger than an epitope (six to seven amino acids for 
15 proteins), which are expected to be on the surface of a protein molecule. In addition, 
the degree of hydrophobicity or hydrophilicity of a given sequence, and how 
immunogenic the sequence would be in animals, can be determined (Genetics Crystal 
Group, Madison, WI 53703). After defining which sequences meet these criteria, the 
peptides are synthetically made, or produced by a number of standard methods. One 
20 or more of these peptides can then be formulated to be used as a vaccine, and 
administered to the host as outlined above, as a vaccine. 

A vaccine comprising a molecule having an amino acid sequence selected 
from the group of sequences encoded by the cDNA of FIG. 1, sequences of FIG. 2, 
encoded by the cDNA, the peptides APPEDNPVED, EEQQEVPPDT, 
25 DGPTGEPQQE, and EQENPDSSEPV, and any fragments or combinations thereof. 

A given vaccine may be administered once to a host, or may be administered 
many times. In order for some patients to recognize a given 
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vaccine, an acjuvant may also need to be administered with the peptides. Adjuvants 
are nonspecific immune stimulators which heighten the immune readiness and aid in 
the conversion of the host from not having detectable serum antibodies to having very 
high titer serum antibodies. It is this high level (titer) of antibodies, which effectively 

5 protects the host from the diseases or conditions to which the antibodies are directed, 
Functional Studies : Studies directed at understanding the cellular functions) 
of Lab are extensions of cell localization/characterization studies (Siddique, et aL, 
1992). Changes in levels of Lab in response to extracellular exposure to various 
response to extracellular exposure to various cations (Ca++, Mg++, Cu++, and Fe++) 

10 were undertaken. Lab expression in A549 cells was only modulated by Ca++. Using 
the highly specific fluorescent Fura-2/AM Ca++ method of measuring cytosolic 
Ca++, (Molecular Probes Inc., Eugene, OR 97402) it was demonstrated that; 1) the 
internal Ca++ concentration is higher in A549 cells than in QU-DB cells, and 2) that 
the A549 cell line responds to various external Ca++ levels (Siddique, et aL, 1992). 

1 5 Since pH can modulate intracellular free Ca++ levels, external pH manipulations 

should result in changes in the expression levels of Lab. Extracellular pH changes (in 
the presence of normal Ca++ concentrations) result in 1) a parallel change in 
intracellular pH as measured by SNARF-1 AM/FACS, (Molecular Probes Inc., 
Eugene, OR 97402) 2) transcript levels increase for Lab (when compared to GAPDH 

20 expression via Northern blot); and that 3) Lab protein also increases (using 

Western/Slot blot analysis). The intracellular changes in pH (due to external changes) 
for A549 cells are Identical to those reported for normal cells. The increased 
expression of lab is also not due to cell death (as measured by MTT assays) 
(Siddique, et aL, 1992). In addition, incubation of recombinant Lab at various pH 

25 solutions does not alter immunoreactivity. Preliminary data suggests that when these 
experiments are conducted on A549 cells grown in reduced Ca++, the induced 
expression of lab is blunted. 

Methods of Diagnosing Cancer Cells in a Sample of Cells : Biological samples 
from a subject are used to determine whether cancer cells are 

30 
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present in the subject. Examples of suitable samples include blood and 
biopsy material. One method of diagnosis is to expose DNA from cells in the 
sample to a labeled probe that is capable of hybridizing to the lab gene, or a 
fragment thereof, under stringent conditions, e.g. 6x ssc; 0.05x blotto; 50% 
5 formamide; 42°C (Sambrook, ef a/., 1 990). Of course, the hybridizing 

conditions are altered to achieve optimum sensitivity and specificity 
depending on the nature of the biological sample, type of cancer, method of 
probe preparation, and method of tissue preparation. 

After contacting the sample with the probe, the next step is 

1 0 determining whether the probe has hybridized with nucleotide sequences of 

the DNA from the sample, from which the presence of the fab gene is 
inferred, said presence being diagnostic of cancer. 

Another diagnostic method is to obtain monoclonal antibodies 
preferably labeled, either antibodies already existing, or new ones directed to 

15 the antigenic peptides that are aspects of the present invention, and contact 

a sample with these to detect the Lab antigen. These monoclonal antibodies 
are useful in the development of very specific assays for the detection of Lab 
antigen, and allow the tests to be carried out in many different formats; 
resulting in a broader application in science and medicine. 

20 The current invention is useful in that it describes a new gene which is 

expressed on the surface of tumors, which was not previously reported. This 
gene is not tissue specific, and therefore will allow the detection of tumors 
regardless of the organ in which they arise. Likewise, the use of this gene to 
produce a vaccine for these tumors, will have a very broad application. 

25 Diagnostic tests will also have this broad tissue use, making the detection of 

Lab/lab a "pan-marker for cancer, in particular for what have been 
designated previously, adenocarcinomas. 
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CLAIMS: 

1 . An isolated cDNA molecule with a nucleotide sequence as 
shown in FIG. 1. 

2. An isolated cDNA molecule with a nucleotide sequence showing 
5 about 70% homology to the nucleotide sequence in FIG. 1 . 

3. A segment of the isolated cDNA molecule of claim 1 , wherein 
the segment extends from the start codon (ATG) to the stop codon TAA in the 
nucleotide sequence and includes 765 base pairs. 

4. A segment of the isolated cDNA molecule of claim 2, wherein 
10 the segment extends from the start codon (ATG) to the stop codon TAA, in 

the nucleotide sequence and includes about 765 base pairs. 

5. A segment of the isolated cDNA sufficient to encode an epitope 
in the Lab protein detectable by the antibody MCA 44-3A6. 

6. An amino acid sequence encoded by the cDNA segment of 
15 claim 1. 

7. An amino acid sequence encoded by the DNA segment of 
claim 2. 

8. A method to diagnose cancer cells in a sample of cells, 
said method comprising: 

20 a- contacting the sample of cells with a labeled probe 

that is capable of hybridizing to the lab gene, or a fragment thereof, 
under stringent conditions; and 

b. determining whether the probe has hybridized with 
nucleotide sequences in the sample, from which the presence of. the 

25 lab gene is inferred, said presence being diagnostic of cancer. 
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9. A molecule with an amino acid sequence selected from the group consisting of 
APPEDNPVED, EEQQEVPPDT, DGPTGEPQQE, and EQENPDSSEPV. 

10. A molecule with the amino acid sequence PTGEPQ. 

11. A peptide selected from the group consisting of all sequences that are between 3 
and 20 amino acids in length, aligned as are the amino acids in FIG. 2, wherein 
the position of the first amino acid of the sequence of n amino acids is selected 
from the group consisting of position 1 to n-3 to n-20. 

12. An antibody directed to a peptide having an amino acid sequence selected from 
the sequence of claim 6, 8, 9, 10 or 1 1. 

13. The antibody of claim 12, further defined as a monoclonal antibody. 

14. An antibody produced in mammals against an amino acid sequence encoded by 
the DNA molecule of claim 2, 3, 4, or 5 or a fragment thereof containing an 
epitope. 



20 15. A method for detecting the cDNA molecule of claim 2, 3, 4 or 5 in a sample of 
cells, said method comprising: 

(a) contacting the sample of cells with a labeled probe that is 
capable of hybridizing to the cDNA or a fragment thereof, under stringent 
conditions; 

25 (b) determining whether the probe has hybridized to the 

nucleotide sequences in the sample; and 

(c) inferring the presence of the cDNA if the probe has 

hybridized. 



15 



30 16. A method to attenuate the effects of expression of the cDNA molecule of claim 
2, 3, 4 or 5, said method comprising: 
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(a) obtaining an antisense molecule to the cDNA molecule or 
its expression product and 

(b) hybridizing the antisense molecule to the cDNA molecule 
or its expression product 

5 

17. The method of claim 16, wherein the antisense molecule is at least 100 
nucleotides in length. 

1 8. An isolated genomic DNA molecule from which the cDNA of claim 2 or 3 is 
10 derived, 

19. A vector comprising the isolated cDNA molecule of claim 2, 3, 4 or 5. 

20. A vector comprising a cDNA molecule that encodes a molecule with an amino 
1 5 acid sequence in accord with claims 6, 8, 9, 1 0 or 1 1 . 
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CGGGAGCTTGAAGQACAC^UVGAATGGQAGGAAAGGCGGACTCT 

TTC^TTCTTCACGTGGTT TATCK3TGATTGCAT^ 

CTGTAGCTG TCGTTTGQTTT^^ 

AAACTAGQAATCTATOATO^ 

TGCCAAAGTTTTATTAGGACTTAAAGAGAGATCTACTTCAGAGCCAGCAG 

TCCCGCCAQAAGAGGCTGAGCC7VCACAOTGAGCCCGAGGAGCAGGTTCCT 

QTGGAGGCAGAACCCCAGAATATCGAAGATGAAGCAAAAGAACAAATTCA 

GTCCCTTCTCCATCAAATGGTACACGCAGAACATGTTGAGGGAGAAGACT 

TQCAACAAGAAGATGGACCCACAGGAGAACCACAACAAGAGGATGATGAG 

TTTC?TTATGG CGACTGATGTAGATGATAGATTTGAGACCCTGGAACTTGA 

AGTATCTCAT GAAGAAACCGAGCATAGTTACCACGTGGAAGAGACAGTTT 

CACAAGACTGTAATCAGGATATGGAAGAGATGATGTCTGAGCAGGAAAAT 

CCAGATTCCAQTGAACCAGTAGTAGAAGATCAAAGATTGCACCATGATAC 

AGATGATGTAACATACCAAGTCTATGAGGAACAAGCAGTATATGAACCTC 

TAGAAAATGAAGGGATAGAAATCACAGAAGTAACTGTCCCCCCTGAGGAT 

AATCCTGTAGAAGATTCACAGGTAATTGTAGAAGAAGTAAGCATTTTTCC 

TGTGGAAGAACAG CAGQAAGTACCACCAGATACTTAAAGCTTCAAAAAGA 

CTGCCCCTACCACCACAGGAGGACCAGCCTAACCATACG CTC CAAAAGAT 

GGCrGTGATAGATCTTGTGAAGCAATTACTGAGCAGAT 

GAAGGAACACTAAAGATGTTTTGAATGAATTATAGTCCACTGGCATTTTA 

GTGTATrTlUTI'l^'CI'TlTlAGAAACACACATTTCTrAAAAATGTC^TGOT 

ACATTCCTGCATGTCCCTTTTGATAGCATT 

TTTTTCTTTTTGTGAGAGAGCTCT 

TTTCCGACAGTGGTTAATAATTATATTGGT^ 

GGCAGGGTTTTCATATATTATTAGTAATTAACACTAACTGTTGG^ 

TTGTGTACACTGTGTTAAACATGATTTAAAAGCTATTAAGAGTACTTTGT 

GTTAGCACTCTTAAAAACGCTAACAGAGATCATCATC 

TGAGTTGTATATACCTGCACTGATATTCTI^ATCAAAAATTI'CTACATTAG 

CTTTAAGTGTTCAGATTAACACTTTTGAA^ 

TTAATTAGAAAAATTAATATTTCAGTGAAAGTTTTAAATTATCATOT 

TATTTTTTTAAATG AGAGGGGAAAG CrTGAAATTCCTTGTTAAG ACACAAG 

G AAAAAGAATGG CCCTACTATTATCATGCAAAAATG CTTTGTTGGCACCT 

CAGATTAATCATATAATAGCTATAGTCTCTTCAGCATTTC 

AGAAAACCTGTATAAATTACTGGTGCATAACTTAAAGATTATTCrGCCTO 

TGG CTAATTGAGTAATTCCCCTC CAGGACTAGAGACCGCT 

ACTAGATGAACTCAGTAACGCCTTGAGCTGGGTTGATTGAGGATGTGTGA 

AAAAGCTCAC^GAGCCCGATGCCTIX3CTGCrrATTTCACGG(^TGAGC^ 

TTTCTTTCTAC^CTGAAGATTTTCTTCTTATTTAATGTGGTTTAT^ 

GCTCAGAAATAATTGCTCTGTTGAAAATAATCCTTTGTCAGAAAAGAAGG 

TAG CTACCACATCATTTTGAAAGGACCATGAGCAACTATAAGCAAAG CCA 

TAAGAAGTGGTTTGATCGATATATTAGGGGTAGCTCTTGATTTTGTTAAC 

ATTAAGATAAGGTGACTTTTTCCCCCTGCTTTTAGGATTAAAATCAAAGA 

TACTTCTATATTTTTATCACTATAGATCATAGTTATTATACAATC 

AGTCCTGCATGGGTACTCGATGTGTAATGAAACCTGAAATAATAATAAGA 

TAATAAGAAAAGCAATAATTTTCTAAAGCTGTGCTGTCGGTGATACAGAG 

ATGATACTCAAATTATAATAAAACTCTTCATTTTGTGAATTATAGAAGCT 

ACTTTTTATAAAGCCATA1T1'T1 W ITAGGGAAACTAAGGAGTGACATAGAA 

CTGATGAATGAGTAAAAGTAAGTTTTGCTGGATTTTTGTAGAACTCTGGA 

CGTTGAGGATTCATTATGCTGTGGTTAACTTTAAATATTTTT 
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SYMBOL KEY 

BOLD = Eukaryotic Leader Sequence 

= Myristylation Site 

lower case = Membrane Anchoring Domain (MAD) 
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